πŸ•ΈοΈ Ada Research Browser

model-strategy.md
← Back

Model Strategy β€” The Ford Estate

Updated: 2026-03-01

Philosophy

Never pay for what you can get free. Never use a sledgehammer when a scalpel works.

The goal: route every request to the cheapest/fastest model that can handle it, spread usage across providers to avoid hitting limits, and reserve paid models for tasks that genuinely need them.

Provider Tiers (by cost)

Tier 0 β€” FREE (use first, always)

Provider Best Models Limits Best For
NVIDIA NIM GLM 5, Kimi K2.5, DeepSeek V3.2, Qwen3 Coder 480B 40 RPM Everything β€” S+ tier models for free
Cerebras GPT-OSS-120B, Qwen3-235B, GLM-4.7 30 RPM, 1M TPD Speed-critical tasks (fastest inference)
Groq Kimi K2, Llama 3.3 70B, Qwen3-32B 1K RPD, 12K TPM Quick lookups, triage
OpenRouter (free) 29 free models 50 req/day Overflow/fallback
Codestral codestral-latest 30 RPM, 2K RPD Code generation/completion
SambaNova Llama 3.3 70B, DeepSeek V3 Free tier Fast inference fallback

Tier 1 β€” CHEAP (pennies per task)

Provider Best Models Cost Best For
Gemini Flash gemini-2.5-flash ~$0.075/M input Daily driver, coordination
Mistral Small mistral-small-latest Free (4M/month cap) European routing, fallback
Together AI Various open models $25 credit Batch work, coding

Tier 2 β€” MODERATE (use thoughtfully)

Provider Best Models Cost Best For
OpenAI GPT-4o-mini gpt-4o-mini ~$0.15/M input When OpenAI quality needed
Mistral Medium mistral-medium-latest Moderate Strong all-rounder
Gemini Pro gemini-2.5-pro ~$1.25/M input Long context, multimodal

Tier 3 β€” PREMIUM (last resort for complex tasks)

Provider Best Models Cost Best For
OpenAI GPT-4o gpt-4o ~$2.50/M input When you need OpenAI flagship
OpenAI o3 o3 ~$10/M input Deep reasoning
Anthropic Opus claude-opus-4-6 ~$15/M input Most complex reasoning, long agentic

Agent Assignments

Ada (Coordinator) β€” needs speed + awareness

K2 (Tech/Homelab) β€” needs coding + technical accuracy

Cora (Real Estate) β€” needs clarity + professionalism

Winston (Family Butler) β€” needs warmth + reliability

Synergy (Wife's Assistant) β€” needs warmth + privacy

Subagents / Cron / Overnight

Load Balancing Strategy

Daily Token Budget (rough targets)

Limit Avoidance

When to Escalate

Pending Activations